Facing the Identification Problem in Language-Related Scientific Data Analysis
نویسندگان
چکیده
This paper describes the problems that must be addressed when studying large amounts of data over time which require entity normalization applied not to the usual genres of news or political speech, but to the genre of academic discourse about language resources, technologies and sciences. It reports on the normalization processes that had to be applied to produce data usable for computing statistics in three past studies on the LRE Map, the ISCA Archive and the LDC Bibliography. It shows the need for human expertise during normalization and the necessity to adapt the work to the study objectives. It investigates possible improvements for reducing the workload necessary to produce comparable results. Through this paper, we show the necessity to define and agree on international persistent and unique identifiers.
منابع مشابه
The Most Common Challenges Facing Iranian English Majors in the Translation Process from English into Persian
The main priority for university translation educators is to improve the quality and outcomes of translation courses. To achieve such a goal, the instructors are required to integrate learners' needs, identified with the help of a needs survey, into syllabus content. Accordingly, the present study was conducted to identify the Iranian English majors' difficulties in translating English texts si...
متن کاملIdentification of Problems faced by Graduates of Farhangian University Campuses in Chaharmahal and Bakhtiari Province
Identification of Problems faced by Graduates of Farhangian University Campuses in Chaharmahal and Bakhtiari Province J. Torkzaadeh, Ph.D.* M. Saadeghiaan Sooraki** R. Marzooghi, Ph.D.*** J. Jahaani, Ph.D.**** To improve any learning environment, its problems need to be identified. To do so at the Farhangian University campuses in the province of Chaharmahal and Bakhtiari Province,...
متن کاملLogical selection of potential hub nodes in location of strategic facilities by a hybrid methodology of Data Envelopment Analysis and Analytic Hierarchical Process: Iran Aviation case study
Hub facility location problem looks to find the most appropriate location for deploying such facilities. An important factor in such a problem is the pool of potential locations from which the optimal locations must be selected. The present research was performed to address two key objectives: identifying the factors contributing to the selection locations for hub establishment, and presenting ...
متن کاملIranian EFL Teachers' Language Assessment Literacy (LAL) under an Assessing Lens
Despite being trained in pre-service teacher education programs, most EFL teachers are underprepared when faced with language assessment-related activities. Part of the problem emanates from the fact that Language Assessment Literacy (LAL) as a construct has not been well defined by experts. The purpose of this study was to pinpoint the components of LAL in the Iranian EFL context using an adap...
متن کاملتحلیل موضوعی مقالات مرتبط با اعتیاد در پایگاه مدلاین به روش خوشه بندی سلسله مراتبی: 2014-1991
Introduction: Addiction, which has recently attracted the attention of researchers, is a serious problem worldwide. The growth of relevant literature contributes to a better understanding of this problem and improves the interaction between executive organizations and academic institutions. It is important to identify the active subject areas within this field and to explore the topics which ar...
متن کامل